Dealing with repetitions in sequencing by hybridization
نویسندگان
چکیده
DNA sequencing by hybridization (SBH) induces errors in the biochemical experiment. Some of them are random and disappear when the experiment is repeated. Others are systematic, involving repetitions in the probes of the target sequence. A good method for solving SBH problems must deal with both types of errors. In this work we propose a new hybrid genetic algorithm for isothermic and standard sequencing that incorporates the concept of structured combinations. The algorithm is then compared with other methods designed for handling errors that arise in standard and isothermic SBH approaches. DNA sequences used for testing are taken from GenBank. The set of instances for testing was divided into two groups. The first group consisted of sequences containing positive and negative errors in the spectrum, at a rate of up to 20%, excluding errors coming from repetitions. The second group consisted of sequences containing repeated oligonucleotides, and containing additional errors up to 5% added into the spectra. Our new method outperforms the best alternative procedures for both data sets. Moreover, the method produces solutions exhibiting extremely high degree of similarity to the target sequences in the cases without repetitions, which is an important outcome for biologists. The spectra prepared from the sequences taken from GenBank are available on our website http://bio.cs.put.poznan.pl/.
منابع مشابه
Reconstruction of DNA sequencing by hybridization
MOTIVATION It is widely recognized that the hybridization process is prone to errors and that the future of DNA sequencing by hybridization is predicated on the ability to successfully cope with such errors. However, the occurrence of hybridization errors results in the computational difficulty of the reconstruction of DNA sequencing by hybridization. The reconstruction problem of DNA sequencin...
متن کاملDealing with errors in interactive sequencing by hybridization
MOTIVATION A realistic approach to sequencing by hybridization must deal with realistic sequencing errors. The results of such a method can surely be applied to similar sequencing tasks. RESULTS We provide the first algorithms for interactive sequencing by hybridization which are robust in the presence of hybridization errors. Under a strong error model allowing both positive and negative hyb...
متن کاملBulletin of the Polish Academy of Sciences
In this paper a greedy algorithm for some variants of the sequencing by hybridization method is presented. In the standard version of the method information about repetitions is not available. In the paper it is assumed that a partial information of this type is a part of the problem instance. Here two simple but realistic models of this information are taken into consideration. The first one a...
متن کاملMicroduplication of Xp22.31 and MECP2 Pathogenic Variant in a Girl with Rett Syndrome: A Case Report
Rett syndrome (RS) is a neurodevelopmental infantile disease characterized by an early normal psychomotor development followed by a regression in the acquisition of normal developmental stages. In the majority of cases, it leads to a sporadic mutation in the MECP2 gene, which is located on the X chromosome. However, this syndrome has also been associated with microdeletions, gene translocations...
متن کاملSequencing by hybridization in the presence of hybridization errors.
DNA sequencing is a very important problem in genomics. Several different sequencing methods are currently utilized. One promising method uses a sequencing chip to obtain information about the presence of subsequences in DNA. This paper deals with sequencing of hybridization data from a sequencing chip, called Sequencing by Hybridization (SBH). Preparata et al. proposed a new sequencing chip us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational biology and chemistry
دوره 30 5 شماره
صفحات -
تاریخ انتشار 2006